NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Federated target trial emulation using distributed observational data for treatment effect estimation

https://doi.org/10.1038/s41746-025-01803-y

Li, Haoyang; Zang, Chengxi; Xu, Zhenxing; Pan, Weishen; Rajendran, Suraj; Chen, Yong; Wang, Fei (December 2025, npj Digital Medicine)

Free, publicly-accessible full text available December 1, 2026
Multicenter target trial emulation to evaluate corticosteroids for sepsis stratified by predicted organ dysfunction trajectory

https://doi.org/10.1038/s41467-025-59643-z

Rajendran, Suraj; Xu, Zhenxing; Pan, Weishen; Zang, Chengxi; Siempos, Ilias; Torres, Lisa; Xu, Jie; Bian, Jiang; Schenck, Edward J; Wang, Fei (December 2025, Nature Communications)

Free, publicly-accessible full text available December 1, 2026
Identifying progression subphenotypes of Alzheimer’s disease from large-scale electronic health records with machine learning

https://doi.org/10.1016/j.jbi.2025.104820

Zhou, Manqi; Tang, Alice S; Zhang, Hao; Xu, Zhenxing; Ke, Alison MC; Su, Chang; Huang, Yu; Mantyh, William G; Jaffee, Michael S; Rankin, Katherine P; et al (May 2025, Journal of Biomedical Informatics)

Free, publicly-accessible full text available May 1, 2026
An adaptive federated learning framework for clinical risk prediction with electronic health records from multiple hospitals

https://doi.org/10.1016/j.patter.2023.100898

Pan, Weishen; Xu, Zhenxing; Rajendran, Suraj; Wang, Fei (January 2024, Patterns)

Full Text Available
Data heterogeneity in federated learning with Electronic Health Records: Case studies of risk prediction for acute kidney injury and sepsis diseases in critical care

https://doi.org/10.1371/journal.pdig.0000117

Rajendran, Suraj; Xu, Zhenxing; Pan, Weishen; Ghosh, Arnab; Wang, Fei (March 2023, PLOS Digital Health)
Frasch, Martin G. (Ed.)
With the wider availability of healthcare data such as Electronic Health Records (EHR), more and more data-driven based approaches have been proposed to improve the quality-of-care delivery. Predictive modeling, which aims at building computational models for predicting clinical risk, is a popular research topic in healthcare analytics. However, concerns about privacy of healthcare data may hinder the development of effective predictive models that are generalizable because this often requires rich diverse data from multiple clinical institutions. Recently, federated learning (FL) has demonstrated promise in addressing this concern. However, data heterogeneity from different local participating sites may affect prediction performance of federated models. Due to acute kidney injury (AKI) and sepsis’ high prevalence among patients admitted to intensive care units (ICU), the early prediction of these conditions based on AI is an important topic in critical care medicine. In this study, we take AKI and sepsis onset risk prediction in ICU as two examples to explore the impact of data heterogeneity in the FL framework as well as compare performances across frameworks. We built predictive models based on local, pooled, and FL frameworks using EHR data across multiple hospitals. The local framework only used data from each site itself. The pooled framework combined data from all sites. In the FL framework, each local site did not have access to other sites’ data. A model was updated locally, and its parameters were shared to a central aggregator, which was used to update the federated model’s parameters and then subsequently, shared with each site. We found models built within a FL framework outperformed local counterparts. Then, we analyzed variable importance discrepancies across sites and frameworks. Finally, we explored potential sources of the heterogeneity within the EHR data. The different distributions of demographic profiles, medication use, and site information contributed to data heterogeneity.
more » « less
Full Text Available
Sepsis subphenotyping based on organ dysfunction trajectory

https://doi.org/10.1186/s13054-022-04071-4

Xu, Zhenxing; Mao, Chengsheng; Su, Chang; Zhang, Hao; Siempos, Ilias; Torres, Lisa K.; Pan, Di; Luo, Yuan; Schenck, Edward J.; Wang, Fei (December 2022, Critical Care)

Abstract Background Sepsis is a heterogeneous syndrome, and the identification of clinical subphenotypes is essential. Although organ dysfunction is a defining element of sepsis, subphenotypes of differential trajectory are not well studied. We sought to identify distinct Sequential Organ Failure Assessment (SOFA) score trajectory-based subphenotypes in sepsis. Methods We created 72-h SOFA score trajectories in patients with sepsis from four diverse intensive care unit (ICU) cohorts. We then used dynamic time warping (DTW) to compute heterogeneous SOFA trajectory similarities and hierarchical agglomerative clustering (HAC) to identify trajectory-based subphenotypes. Patient characteristics were compared between subphenotypes and a random forest model was developed to predict subphenotype membership at 6 and 24 h after being admitted to the ICU. The model was tested on three validation cohorts. Sensitivity analyses were performed with alternative clustering methodologies. Results A total of 4678, 3665, 12,282, and 4804 unique sepsis patients were included in development and three validation cohorts, respectively. Four subphenotypes were identified in the development cohort: Rapidly Worsening ( n = 612, 13.1%), Delayed Worsening ( n = 960, 20.5%), Rapidly Improving ( n = 1932, 41.3%), and Delayed Improving ( n = 1174, 25.1%). Baseline characteristics, including the pattern of organ dysfunction, varied between subphenotypes. Rapidly Worsening was defined by a higher comorbidity burden, acidosis, and visceral organ dysfunction. Rapidly Improving was defined by vasopressor use without acidosis. Outcomes differed across the subphenotypes, Rapidly Worsening had the highest in-hospital mortality (28.3%, P -value < 0.001), despite a lower SOFA (mean: 4.5) at ICU admission compared to Rapidly Improving (mortality:5.5%, mean SOFA: 5.5). An overall prediction accuracy of 0.78 (95% CI, [0.77, 0.8]) was obtained at 6 h after ICU admission, which increased to 0.87 (95% CI, [0.86, 0.88]) at 24 h. Similar subphenotypes were replicated in three validation cohorts. The majority of patients with sepsis have an improving phenotype with a lower mortality risk; however, they make up over 20% of all deaths due to their larger numbers. Conclusions Four novel, clinically-defined, trajectory-based sepsis subphenotypes were identified and validated. Identifying trajectory-based subphenotypes has immediate implications for the powering and predictive enrichment of clinical trials. Understanding the pathophysiology of these differential trajectories may reveal unanticipated therapeutic targets and identify more precise populations and endpoints for clinical trials.
more » « less
Full Text Available
Artificial intelligence for COVID-19: battling the pandemic with computational intelligence

https://doi.org/10.1016/j.imed.2021.09.001

Xu, Zhenxing; Su, Chang; Xiao, Yunyu; Wang, Fei (February 2022, Intelligent Medicine)

Full Text Available
Deep learning in mental health outcome research: a scoping review

https://doi.org/10.1038/s41398-020-0780-3

Su, Chang; Xu, Zhenxing; Pathak, Jyotishman; Wang, Fei (December 2020, Translational Psychiatry)
null (Ed.)
Full Text Available
Order-Preserving Metric Learning for Mining Multivariate Time Series

https://doi.org/10.1109/ICDM50108.2020.00080

Xu, Jie; Xu, Zhenxing; Yu, Bin; Wang, Fei (November 2020, Proceedings of the 2020 IEEE International Conference on Data Mining (ICDM))
null (Ed.)
Full Text Available
Federated Patient Hashing

https://doi.org/10.1609/aaai.v34i04.6121

Xu, Jie; Xu, Zhenxing; Walker, Peter; Wang, Fei (June 2020, Proceedings of the AAAI Conference on Artificial Intelligence)
null (Ed.)
Privacy concerns on sharing sensitive data across institutions are particularly paramount for the medical domain, which hinders the research and development of many applications, such as cohort construction for cross-institution observational studies and disease surveillance. Not only that, the large volume and heterogeneity of the patient data pose great challenges for retrieval and analysis. To address these challenges, in this paper, we propose a Federated Patient Hashing (FPH) framework, which collaboratively trains a retrieval model stored in a shared memory while keeping all the patient-level information in local institutions. Specifically, the objective function is constructed by minimization of a similarity preserving loss and a heterogeneity digging loss, which preserves both inter-data and intra-data relationships. Then, by leveraging the concept of Bregman divergence, we implement optimization in a federated manner in both centralized and decentralized learning settings, without accessing the raw training data across institutions. In addition to this, we also analyze the convergence rate of the FPH framework. Extensive experiments on real-world clinical data set from critical care are provided to demonstrate the effectiveness of the proposed method on similar patient matching across institutions.
more » « less
Full Text Available

« Prev Next »

Search for: All records